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Abstract 

\ 

\. 

The  non-central  chi-square  distribution  arises  in  various  stati 
tical  analyses.  The  estimation  of  the  non-centrality  parameter  of 
the  distribution  is  of  importance  in  some  problems.  In  this  paper 
it  is  shown  that  the  maximum  likelihood  estimator  is  inadmissible 
with  respect  to  the  squared  error  loss  function.  It  is  trivially 
minimax  since  all  estimators  have  unbounded  maximum  risk.  A  class 
of  estimators  is  given  which  are  admissible  and  minimax  for  a  modi¬ 
fied  loss  function. 


The  author's  work  was  supported  by  the  Office  of  Naval  Researc 
under  Contract  N00014-75-C 

AMS  1970  Subject  Classification  62C1S. 


Key  words  and  phrases : 


Chi-square  Distribution,  Non-Centra li ty 


Pa rame te r ,  Max imum  L i ke 1 i hood  , 


Admissible  and  Minimax  Estimators. 

i  A 


(1) 


1.  Introduction.  The  non-central  chi -square  distribution  arises 
in  various  statistical  analyses,  such  as,  the  analysis  of  variance 
and  Pearson's  chi-square  test  for  goodness  of  fit.  A  discussion  of 
various  applications  of  the  distribution  is  given  in  Johnson  §  Kotc 
((31,  §28.9).  For  an  example  in  electrical  engineering,  Spruill 

(19.9)  has  shown  that  the  measurement  of  electrical  power  in  a  circuit 
is  related  to  the  estimation  of  the  non-centralitv  parameter  of  a 
chi-square  distribution.  ; 

Let  X  be  distributed  according  to  the  non-central  chi-square  distribui 

I 

I 

tson  with  p  degrees  of  freedom  and  non-centralitv  tsarametcr  equal 

*  -  1  l 

« 

to  A  .  It  is  known  that  X-p  is  a  uniformly  minimum  variance  unbiased  ; 
estimator  (U.MVUE)  of  A.  Perlman  and  Rasmussen  (1975)  have  shown  that 
a  class  of  estimators,  given  by  ; 


(i.l) 


5  (X)  = 


P  2  5 ,  0  <  b  <  4 (p-r) 


has  uniformly  smaller  mean  squared  error  (MSE)  than  the  U.MVUE.  Neff 
and  Strawdcrman  (1976)  have  extended  the  class  to  the  familv  of  estima¬ 
tors,  given  bv 


(1.2) 


c(X)  =  v-p+— 


Oi  J 

<  a  <  V 


(1-3) 


i(x)  =  x-p-4- 


n  >  a 


where  b  and  c  are  positive  numbers.  fj 

It  is  interesting  to  observe  that  the  U.MVUE  has  smaller  MSE  than  j 

jjg 

the  estimator  X+p  which  is  a  Baves  estimator  with  respect  to  an  improper  .  H 
prior  distribution  (see  deV.’aal  (  1974))  and  Perlman  and  Rasmussen  (1975).  1  fj 


(2) 


On  the  other  hand,  the  UMVUB  is  itself  inadmissible,  as  it  is  dominated 
by  (X-p) ' ,  the  positive  part  of  X-p,  given  by 


+  '0  , 

(1.4)  (X-p)  =■ 


X  <  D 


X-D  ,  X  >  D. 


Here  and  throughout  the  following,  admissibility  is  tacitly  defined 
with  respect  to  the  squared  error  (SE)  loss  function.  Let  c  ^  0 . 

4. 

We  shall  show  that  (X-c) '  is  inadmissible  for  c  <  p  and  that  there 
are  no  two  values  of  c  >  p  for  which  one  estimator  dominates  the  other. 

Comparing  the  MSE  of  the  estimators  given  by  (1.2)  and  (1.3)  with 
that  of  (X-p)+  near  the  origin,  it  is  seen  that  none  of  them  dominates 
(X-p)+.  It  is  not  known  whether  any  of  these  estimators  is  admissible. 
On  the  other  hand,  we  shall  show  that  all  estimators  have  unbounded 
maximum  risk  (MSE).  Therefore,  all  estimators  are  trivially  mlnimax 
with  respect  to  the  squared  error  loss.  However,  if  the  loss  function 
is  changed  to  (SE)/(.\+s)  =  L„,  sav,  where  z  is  ,a  positive  number, 
then  there  exists  a  class  of  estimators  with  bounded  maximum  risk. 

Wc  have  derived  a  class  of  proper  Bayes  estimators  which  are  shown 
to  be  mimimax  with  respect  to  L  ,  and  also  admissi'o! e . 


Meyer  (1367)  has  considered  the  maximum  likelihood  estimator  (MLE)  I 

for  p  =  3.  We  show  that  the  MLE  is  inadmissible  for  p  ^  I  but  mimimax  * 

with  respect  to  L^.  Also,  wc  consider  the  derivation  of  the  MLE  in  ;  \ 

the  general  case  when  the  sample  site  is  larger  than  1.  |  | 

4  -f 

The  question  of  admissibility  of  the  estimator  (X-p)'  is  of  special  4 
significance.  We  have  not  been  able  to  establish  its  admissibility 
(inadmissibility).  However,  wc  conjecture  that  the  estimator  is  admissi-  1 


(3) 


2.  Admissible  minimax  estimators.  The  following  results  pertain 
to  the  non-central  gamma  distribution  whose  density  function  is  given 
by 


(2.1) 


fx(x) 


=  vP'1 


■x 


(Xx) 


r=0 


F  (p+r) r ! 


-\-x  x  (P'^/2 

=  e  A  X  (f)  I 


P-1 


(2/Xx)  ,  x  >  0 


where  I^(x)  denotes  the  modified  Bessel  function  with  parameter  p. 

Let  X  be  distributed  according  to  the  distribution  (2.1)  then  2X 
is  distributed  according  to  the  chi-square  distribution  with  2p  degree 
of  freedom  and  non-centrality  parameter  2X. 

We  consider  an  a'priori  distribution  for  X  which  is  a  mixture 
of  gamma  distributions  with  v  degrees  of  freedom  and  scale  factor 
c,  and  6  =  ( 1+c)  *  being  distributed  according  to  a  beta  distribution, 
given  by  the  density  function 


h f  - 1  = 

11  V.  - ) 


1  (1-e) 


■3  -  1 


-  |  S'*  -S' 

*•  V.  ^  J 


9  > 


v  • 


Given  c,  the  conditional  density  function  of 


is  given  by 


(2.2) 


(X)  = 


V  ,  V  -  1  -  C  A 

:  a  e 
T(v) 


Let 


;x)  = 


+  b  X 


a(a+l)  x~ 
b (b+1 )  2! 


(4) 


denote  the  confluent  hypergeometric  function,  and  let 

Ffa  h-r  d-vi  -  7  f (a+r)F (b+r)  xr 

F(a,b,c,d,x)  ^  r(c+r)r(d+r)  ri 

denote  the  generalized  hypergeometric  function.  The  posterior  mean 
of  A  is  given  by 


(2.5) 


r1  ,  r1 

A  =  (v  [  3(1-9) v$(v+l  ,p;9x)h(6)d9)/(  s  ( 1  -  8)  <’?(  v  ,  p ;  9x)  h(3)  d9) 

}  0 


‘o 


=  F(v+1  ,ot+l ;p ,a+S+v+l ;X)  /  F(v,a;p,a+6+v;X) . 

The  posterior  mean  given  by  (2.3)  is  a  Bayes  estimator  of  A  with  respect 
to  the  squared  error  loss  and  the  given  prior  distribution.  The  Bayes 
estimator  is  admissible  since  the  prior  distribution  assigns  positive 
probability  measure  to  every  open  interval. 

From  (2.3)  and  the  formula  for  the  asymptotic  expansion  of  the 
generalized  hypergeometric  function  (see  e.g.,  Wright  [8])  the  value 
of  a  for  large  X  is  given  by 


(2.4) 


;>  =  x(i-*2^_  +  o (x  “)) 


From  (2.1)  it  is  seen  that  the  distribution  of  X  is  stochastically 
increasing  in  A,  and  Its  moments  arc  given  by  EX=ptA  ,  var  (::)  =p  +  2,‘. 


E(X~ 1 )  = 


P  (P-r)  -A.  ,  „ 

FTp)  e  r,?’  J 


‘  +  0(a  r"2)  for  large  values  of  A 


There  fore , 

(2.3)  MS  FA  =  p+482  +  2A  +0(A'1/2). 


(5) 


Consider  a  gamma  prior  for  X  ,  given  by  the  density  function  (2.2) 
with  c  being  a  fixed  positive  number.  The  posterior  mean  with  respect 
to  this  distribution  is  given  bv 


(2.6) 


_  yfr('j+l,p;X/(l+c)) 
(1+c)  <!■(  v,p  ;X/ (1+c) ) 


=  _E_  +_J< _ _ 

lTC  (1 +c)“ 


for  v  =  p  . 


From  (2.6)  we  have 


(2.7) 


MSE  6  =  (l+c)'4[p  +  2;-.+  (2+c)2(p-cA)i] 


i 

Bn 


(2.8)  |  (MSEc  ) g  u(X)dX 

J  o  P  c  *  P 


(l+c)-4[p+±E  *  P (2+c) 2 ] 


=  o(c)  ,  say. 


Since  e  is  a  Saves  estimator  and  o(c) 

p  •  ' 


as  c  ~  0,  it  fellows  that 


all  estimators  have  unbounded  maximum  risk,  and  are  therefore  trivially 

A  A. 

minimax  with  respect  to  the  squared  error  loss.  Hence,  \  and  c,.  are 


As  shown  above,  all  estimators  have  unbounded  maximum  risk  with 

respect  to  the  squared  error  loss.  Therefore,  we  consider  the  loss 

function  and  a  new  prior  distribution  for  X,  given  by  the  density 

function  k(\+e)g  (A),  where  k  is  a  normalizing  factor,  equal  to 

>P 

A 

c7(p+ce).  Clearlv,  5  ,  given  bv  (2.6),  is  Bnves  with  respect  to  L 

P  £ 

and  the  new  prior  distribution,  with  respect  to  which  the  average 
loss  is  equal  to  c,:  (c)  /  (p+ce) .  Since 


The  inequality  in  (2.9)  snows  that  (X-c) '  is  inadmisible  for  c  <  p. 

t 

From  (2.10)  it  is  seen  that  M  (X)  is  negative  for  c  <_  p  and  that  it 
tends  to  0  as  c  -*■  -.  The  quantity  inside  the  square  bracket  on  the 
right  hand  side  of  (2.11)  is  negative  for  ,\  <  1  and  is  increasing  in 
c  (by  lemma  A2  of  the  Appendix),  tending  to  (X-l)A”  as  c  +  *, 

t 

Therefore  Mc(X)  is  concave  in  c  for  X  <  1  and  for  X  >  1  it  is  first 

concave  then  convex  as  c  varies  from  0  to  It  follows  that  for 

each  A  there  is  a  unique  value  of  c  =  c,(>p),  say,  for  which  M  (X) 

is  minimized  (given  by  Mc(X)=0)  as  c  varies  from  0  to  <=,  where 

c -c  for  X  1 .  Moreover,  for  each  c  >  p  there  exists  a  value 

of  X  :•  1,  equal  to  X  ,  say,  such  that  MO.)  =  0,  since  M  *  ( 0 )  <  0 

c  c  c  c 

t 

and  M  (®)  =  2(c-p)  >  0.  Thus  there  exists  a  A  such  that  M  fA„)  = 

c  ^  c 

in  M  (X  ) . 

'»?  '  . 

Let  c  >c>p.  From  the  preceding  results  we  have  that  the 

ineouaiitv  M  (X)  >  M  holds  for  X<  1  and  that  M  (X  )<  M  ,  (X  's . 

»  c  ^  c  *  _  c"s  ^ c  ^ 

^  4.  •  j. 

Tnereiore  neither  of  the  estimators  (X-c)  and  (X-c  dominates 
tiie  other. 

Let  h  ( )  =  >!('')  -  M  f\).  It  is  casv  to  show  that  h  (*>1<0 

f  '  -  r\  '  —  ^  »  m  X  d 

v  i/  u  L. 

and  h  (o)-'*0.  Therefore  between  the  estimators  (X-c)  and  (X-p.:  + 
neither  dominates  the  other  for  anv  c  >  o. 


re spec; 


*o:n  (2.9)  it  is  seen  that  the  maximum  risk  of  (X-c)  with 
to  L  is  bounded  above  bv  2  for  sufficiently  large  values 


of  .  Therefore  (X-c)  ‘  is  minimax  with  respect  to  L  .  We  summarize 
the  preceding  results  in  the  following  theorem. 

Theorem  2.2.  The  estimator  (_X-c)  is  minimax  for  c  >  0 
with  respect  to  L.  for  sufficiently  large  values  of  s.  It  is  inadmi 


siD! e  tor  c  <  u. 


The  class  of  estimators  A_  is  irreducible. 


jjj.  i 


5.  Maximum  likelihood  estimator.  Some  properties  of  the 
Bessel  function  are  given  in  the  Appendix.  The  given  result  will 
be  used  below  to  show  the  inadmissibility  and  the  mimimax  property 
of  the  MLE.  Equating  to  0  the  derivative  with  respect  to  X  of 
the  density  function  given  by  (2.1),  we  get 


(3.1) 


Ip  ( 2 t'Tx )  / 1 p _ .  ( 2 / X x ) 


Let  a  denote  the  MLE.  The  quantity  on  the  right  hand  side  of 

(3.1)  is  monotone  decreasing  in  X  by  Lemma  A.2  of  Appendix,  and 

* 

tends  to  x/p  as  X  *  0.  Therefore,  >  =  0  for  x  £  p,  and  for  x  >  p 


I 

. 


it  is  uniquely  given  as  a  solution  of  (3.1) 
From  (3.1),  we  have 


(3.2)  a  =  v'TS  Ip(2/Xx)  /  I  x ( 2vTx) 


=  v-Tx  I  (2r/Xx)  /  (JL-  i  i2^x)  *  lp+1  (3^:0) 


>  /ax  I  (2/\x)  /  (-^  I?.  L  (2/ax)  ♦  I  f2rT5)) 
*  VAX  ! 


Ax  /  C-^3  +  !\,  . 


v  Xx  v 


The  above  inequality  follows  from  Lemma  A3.  From  Lemma  A1  and 
the  first  equality  in  (3.2)  it  follows  that  X  <  x,  and  from  the 
last  cuualitv  it  follows  that  X  --  x-o.  Thus  sc  have  the  intere: 


x  -  n  <  \  <  x  lor  x  >  n. 


Let 


=  L  \  xj  ”  ana  write  a  tor 


i  nen 


(3.4) 

and  from 
2  I 

P- 


(3.1) 


dZ 

dx 


I  (2) 

P 


or 

7  v 

(r/4)‘ 
rircp+r+l) 

Differentiating  both  sides  of  (5.5)  with  respect  to  x,  we  get  after 
simplification 


(3.5) 


) 

r=0 


7  Y 
( Z  ~  /  4  )  r 

r(p+r)rT 


=  x 


> 

r=0 


(3-6) 


Z  dz  = 

2  cfx  .\-x+: 


Sience  from  (3.4)  and  (3.6)  we  get 


(3.7) 


d;-  _ 


^  =  A  f  T- 


■ax  'a-x+p 


1) 


Differentiating  (3.7)  with  respect  to  x  we  get 


(3.S)  x  = 


ClX 


(1-X  +  p)-  ( •*.  *  X + p ) 


Consider  the  behavior  of  \  as  a  function  of  x.  From  the  second 


(10) 


The  above  relation  shows  that  X  -+  0  and 


ihp  -  ^  115  x  *  P  *  0  • 


Then  from  (3.7)  we  have  that  ■+  >  as  x  ■+  p+0.  Let  x^  denote 

the  smallest  value  of  x  >  p  for  which  J-  =  0.  Then  — %  <  0  for 

ax  ,  ,2  — 

dx  ,2\ 

x  =  Xq.  On  the  other  hand,  from  (3.8)  it  is  seen  that  — -  >  0 


dX 


ax 


at  x  =  Xq.  Therefore,  >  0  for  all  x  >  p.  Hence,  from  (3.7) 

* 

we  have  that  X  <  x-p+1. 

Let  x°  denote  the  smallest  value  of  x  >  p  for  which  ^  =  l.From 

(3.9)  the  value  of  x  =  x^  is  given  by 


(3.9) 

or 


x  =  \( 


1 


X-x+p 


-1) 


(3.10)  2X  =  1-p  +  ( (2x-p) "  -  2p+l)  1  . 
Therefore 

(3.11)  1-2 (X-x+p)  =  2x-p  -  ( (2x-p) 2  -  2p+l) ^ 


>  0  for  p  >  \ 


dX 


Putting  =  1  in  (3.8)  and  using  (3.S)  we  get,  after  simplification 


■)  ■!  ^  y 

(3.12)  x  (X-x+p)  ~  M-  =  (?-x+p)  (l-2X  +  2x-2p) 

dx  “ 


>  0  by  (3.3)  and  (3.11)  . 

2 

Since  4-1  •+  2^-  >1  as  x  -+  p+0,  it  follows  that  — y  <  0  at  x  =  , 

dx2  - 

contrary  to  (5.12).  Therefore  >  1  for  all  x  >  p.  Hence, 

*  + 

X  -  (x-p)  is  increasing  in  x.  From  this  result  it  follows  that 
* 

X  is  inadmissible,  as  shown  below. 

Let  p  >  |.  We  have 

(3.13)  MSE  X*  '  MSE  (x-p)+  =  E (x* - (x-p) +) Q *+ (x-p) +- 2X) 

>  E(x*-(x-n)+)  E ( \ *  +  ( x ■ o  )  +  -  2  \  ) 

>  0  • 


(11) 


The  first  inequality  in  (3.12)  follows  from  the  fact  that  each 

*  +  *  + 

of  the  quantities  X  -  (x-p)  and  X  +  (x-p)  -  2A  is  increasing 

in  x.  Te  second  inequality  follows  from  (5.3)  and  from 
E(  A*  (x-p)  +  -  2  A)  >  2E  (x  -p  -  A)  =  0. 

Thus,  the  MLE  is  inadmissible,  being  dominated  by  the  estimator 
(x-p)+ . 

From  the  asymptotic  expansion  of  the  Bessel  function  we  can 
show  that  for  large  x 


X  =  x  -  p  +  i  +  o  (-)  • 


Hence,  for  large  values  of  A 


m 


MSE  A'  =  2 A  +  0  (1)  . 

* 

Therefore,  A  is  minimax  with  respect  to  lc  for  sufficiently  large 
values  of  z.  We  summarize  the  preceding  results  ia  the  following 
Theorem. 


Theorem  3.1.  The  maximum  likelihood  estimator  \  j  s  uniquely 
determined  by  (5.1)  for  x  >  p  and  is  equal  to  0  for  x  <_  p.  Moreover, 
X-p  <  A  <  X-p+1.  The  MLE  is  inadmissible,  being  dominated  by 

(X-p)  ,  and  is  minimax  with  respect  to  L  for  sufficiently  large 

£ 

values  of  c. 

Theorem  3.1  pertains  to  the  MLE  for  a  sample  of  size  n=l. 

We  consider  now  the  case  n  >  1.  Let  x]_»**.,xn  De  sample  values, 

•v 

and  let  A  denote  the  MLE.  The  likelihood  equation  for  the  MLE 
n  1 

is  given  by 


(12) 


OB'  i 


mmm 

m 

ip 

M 


1 


(3.14)  n  =  l  (I,.  (2  Ax7)  /  I  ,  (2/—))  . 

i  =  l  /Hi  p  *  p  1 

Each  term  of  the  summation  on  the  right  hand  side  of  (3.14)  is 
monotone  decreasing  in  X  by  Lemma  A  2  and  the  maximum  value  of 
the  sum  corresponding  to  \  +  0  is  equal  to  nx/p,  where 

_  1  r  * 

x  =  —  V  x.  denotes  the  sample  mean.  Therefore,  X  =0  for 
n  e  .  i  r  ’  n 

1  =  1 

_  _  * 

x  <  p  and  for  x  >  p  the  value  of  Xn  is  uniquely  given  as  a  solution 
of  (3.14). 

From  (3.14)  and  Lemma  A  1  we  have 


n  <  l  -/x  •  /  X 
i  =  l  x 


mm  i 

®issiiir  * 

■M 

w:n 


Therefore 


(3.15) 


*  1  -  _ .  ~> 

X  <  (-  I  /x • ) “  . 

n  ~  n  i=l  1 


For  a  lower  bound  on  the  value  of  X  .  we  have  from  (3.14) 

n 


n/X  =  l  /x.  I  (2/Tx . )  / 1  n  (2/Xx . ) 

i  =  l  x  p  1  p  x 


\  A-  f 


I  ( 2/Xx”.  )  -1 

p+lv  I  - 


.  =  1  A  /Xx~  I  (2/Xx7) 

i  n  i 


i.l.  "P  l®*!  /xi  Vitvxxp/! 

l=i  /  \  1=1  1  r 


xj) 


by  Jensen’s  inequality 


2,-np  .  v 


1/xI)'(~^  +  t_,vXi  V2/hi}/Ip  /2'Axi)) 


/x  i  =  l 


by  Lemma  A  3 


Appendix 


Let  Q^OO  =  Ip(x)/Ijj_  .(x)  denote  the  ratio  of  two  Bessel  functions 
Lemma  A  1 .  Qp(x)  f  1  for  p  >  1  and  x  >  0 

Proof:  From  the  series  expansion  of  the  modified  Bessel  function 
the  inequality  Q  (x)  <  1  reduces  to 
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r£0  r ! r (p+r)  -l2J  r£0  r ! r (p+r+1) * 


Denoting  the  left  hand  side  of  (1) 
the  (r+l)th  term  of  the  series,  wc 
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by  y  a  ,  where  a  denotes 
1  s  r’  r 

r =1 
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for  p  >  1 


m  x , 


Lemma  A  2 .  xQ^(x)  is  increasing  in  x  and  Qp(x)/x  is  decreasing 


Proof:  We  have 


x0  .'Xl  =  rf£)  y  __0siZAiIi  /  r  |  ( x " / 4  ) v 
V  }  2  r 1 F CP+r+l) 3  ■  1  r£n  r !  F  (p+r) ^ 


=  2E(J) 


(15) 


where  J  denotes  a  discrete  rancom  variable  whose  distribution  is  given 
by 


P{ J=r } 


(x~/4) r  (  r  ( x 2 / 4)  m  >-l 
r!F(p+r)  m!r (p+r)' 


r=0,l, 


) 


Since  the  distribution  of  J  has  monotone  likelihood  ratio  property  in 
x,  it  follows  that  E( J) is  increasing  in  x.  That  Qp(x)/x  is  decreasing 
in  x  follows  from  the  preceding  relation  and  the  recurrence  relation 


i  ,w-  *  ¥  yx)  ■ 

P-1  ‘  p 

Lemma  A  3.  Qp(x)  >  Qp+i^x)  ail  P  anc*  x  >  0* 

Proof:  We  have 

(2)  [(x'pIp(x)/(x'p+1Ip_1(x))  =  Qp(x)/x. 


The  derivative  with  respect  to  x  of  the  quantity  on  the  left  hand  side 
of  (2)  is  negative,  since  Qp(x)/x  is  decreasing  in  x  by  Lemma  A  2 
There  fore 


-Pi  (X)  -p*lj  ,(x))>x-p+l 

pk  '  dx '  p-rv 


(x)  fe(x-pIp(x)) 


or 

Ip(x)lp(x)  >  ip^OOIp^OO 

by  Formula  9.6. 2S  of  Abramowitz  and  Stcgun  [1].  Hence 


Vx)  >  Vi^  ■ 


(16) 
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